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A  Generalized  Goal  in  Restricted  Subset  Selection  Theory 


by 

Thomas  J .  Santner 
Cornell  University 


SUMMARY 

A  class  of  multiple  decision  procedures  which  selects  a  random  size  sub¬ 
set  of  populations  not  exceeding  m  (determined  a  priori)  in  size  has 
previously  been  considered  for  the  problem  of  selecting  the  best  population 
among  k  candidates.  The  present  paper  generalizes  the  earlier  results  to 
the  goal  of  selecting  at  least  one  of  the  t  best  populations.  Applications 
of  the  basic  theory  can  be  made  to  many  specific  problems  considered  in  the 
-literature.  In  the  regular  case  these  include  selection  from  univariate 
normal  populations  for  means  and  variances ,  from  gamma  populations  for  scale 
factors  and  from  noncentral  chi  square  or  noncentral  F  populations  for  non¬ 
centrality  parameters  and  in  the  nonregular  case  selection  from  uniform 
populations  with  different  supports. 

Some  key  words:  Ranking  and  selection;  Multiple  decision  procedure; 

Restricted  subset  selection;  Normal  means;  Noncentral 
F  populations . 


1.  Introduction  and  Summary 


Let  (X,B,P^),  i  =  l,...,k  be  k  probability  spaces  which,  as  is 

usual  in  the  literature ,  shall  be  referred  to  as  populations  and  denoted  bv 

n  .  , . . .  ,ir,  .  For  this  paoer  X  is  a  finite  dimensional  Euclidean  space,  C 

is  its  Borel  sigma  field  and  is  an  unknown  probability  measure  from  P 

a  specified  parametric  family  of  probability  measures  on  (X,^).  The  worth 

of  each  it.  is  characterized  by  an  unknown  scalar  X.  =  (P.)  e  A  a  known 

1  11 

interval  on  the  real  line.  Let  Ar,,  <  •••  <  Ar,  ,  be  the  ordered  A.'s, 

[lj  —  —  no  i 

tt^j  the  (unknown)  population  with  parameter  A^^  and 
n  =  {/A  =  (A^,. ..  ,A^)|A^  e  A  for  ej.1  i}  the  space  of  all  possible  under¬ 
lying  configurations  of  A^'s. 

Historically  the  first  approach  to  this  problem  was  the  indifference 
zone  foiroulation  of  Bechhofer  (1954)  in  which  the  experimenter  selects  a 
fixed  number  of  populations  and  his  goal  is  guaranteed  to  be  met  with  at 

least  probability  P*  whenever  the  true  A  lies  outside  some  subset  or  zone 

*\j 

of  indifference  of  the  parameter  space  Q.  In  contrast  to  the  indifference 
zone  approach  Gupta  (1956,65)  proposed  the  subset  selection  formulation  in 
which  a  random  size  (between  one  and  k)  subset  of  the  k  populations  is 
selected  such  that  there  is  at  least  probability  P*  of  achieving  the 
experimenter's  goal  no  matter  what  the  actual  ^  e  fl.  The  specific  goal 
studied  in  this  paper  is  selection  of  at  least  .>ne  of  the  t  best  populations 
ir ^  t+l)  **  *  *  ,n(k)*  Previ°us  work  on  this  same  topic  includes  Mahamunulu  (1967). 
Desu  and  Sobel  (1968),  Sobel  (1969)  and  Panchapakesan  (1969).  It  is  assumed 
there  is  no  a  priori  knowledge  of  the  correct  pairing  of  the  and  T ay 
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A  class  of  single  sample  restricted  subset  selection  procedures  is  pro¬ 
posed  for  this  goal  which  gives  more  flexibility  to  the  experimenter  than 
does  either  the  fixed  subset  size  random  subset  procedures  by  allowing 
him  to  specify,  before  experimentation,  an  upper  bound  m  on  the  number  of 
populations  included  in  the  selected  subset.  However  should  the  data  clearly 
indicate  that  a  particular  population  is  one  of  the  t  best,  this  rule 
retains  the  advantage  of  the  subset  selection  procedure  over  the  fixed  size 
subset  rule  in  allowing  selection  of  fewer  than  m  populations.  Restricted 
subset  selection  procedures  were  introduced  for  the  normal  means  problem  in 
Gupta  and  Santner  (1973).  The  notation  and  the  rule  R(n)  of  the  present 
paper  follows  Santner  (1974)  where  a  general  theory  for  the  goal  t  =  1 
was  given.  Hence,  as  will  be  noted  in  Section  4,  several  properties  of  the 
proposed  rule  can  be  found  in  the  earlier  paper.  Both  subset  selection 
(t  >  k-m)  and  indifference  zone  (t  k-m)  probability  requirements  will 
be  studied. 

In  Section  2  the  problem  will  be  formalized  and  a  class  of  procedures 
proposed  for  its  solution.  In  Section  3  the  infimum  of  the  probability  of 
a  correct,  'election  is  simultaneously  derived  for  both  the  subset  selection 
and  indifference  zone  cases.  The  location  or  scale  parameter  problems  are 
seen  to  yield  a  one  stage  minimization  and  sufficient  conditions  are  given 
to  allow  its  derivation  in  the  general  case.  Section  4  gives  properties  of 
the  proposed  rule  while  Section  5  gives  applications  of  the  theory  to 
selection  from  normal,  noncentral  F  and  uniform  populations.  Tables  are 
included  to  allow  implementation  of  the  normal  means  procedure. 
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2.  Formulation  of  the  Problem 


Each  if .  yields  iid  observations  {X.  .  }n  which  are  also  independent 
1  j=l 

between  populations.  It  is  assumed  there  exists  a  sequence  of  Borel  measur¬ 
able  functions  {T  }  with  T  :  Xn  -*■  A  and  such  that  T.  =  T  (X. X.  ) 
n  n  m  n  ll  in 

converges  in  probability  to  a  monotone  function  (independent  of  i)  of 

*i 

A.  as  n  •+  ®.  Also  T.  has  cdf  G  (ylx.)  with  support  E  which  is 
l  id  n  3 '  i  n 

absolutely  continuous  with  pdf  ?n(ylA.j^*  For  eac^  n  (Gn(y|A)|x  e  A}  is 

assumed  to  be  a  stochastically  increasing  family.  Let  p:  A  R  and 

h  :  (J  E*  ■+  R  satisfy  respectively  (2.1)  and  (2.2)  of  Santner  (1974),  the 
X 

main  conditions  of  which  are  p(X)  <  X  for  all  X  e  A;  h  (x)  >  "  ^or 

n  and  h  (x)  -*•  x  as  n  -*■  »  for  all  x. 
n 

An  indifference  zone  will  be  defined  in  R  by  means  of  the  function 


p(X).  Let 

=  U  e  nlA[k-t]  -  p(X[k-ttl])} 

Vp)  '  {^X[1]  S  A[k-t]  S  p(X[k-t+l]),X[k-t+l]  =  X[k]} 

°0  =  UlX[i]  =  A[k]}  * 

For  each  (h  (x)}  define  the  .vie 
n 

R00:  Select  *j  <-  Tln  »  (2.1) 


where  •«_••• 

suppressed  from 


-T[k] 


l[i]n 


are  the  ordered  estimators  and  the 
tor  ease  of  notation. 


n 


has  been 


fin 


Example  2.1.  For  p(X)  =  X-6,  6  >  0,  and  h^(x)  =  x  +  d//r I". 


^(p)  *  UlA[k-t+i]  -  x[k-t:  i 41 


°0<P>  *  (^'[1]*4  5  X[k-t3  +  4  =  X[k-t+l]  *  X[k:* 


R(n):  Select  *.  <«=>  T.  >  max{Tr,  Tr,  ,  -  d/*4T} 

l  in  —  Lk-m+lj’  [kj 


Goal  I.  Given  P*,  p(X),  t  and  R(n>  find  the  smallest  common  sample 
size  n  needed  to  satisfy  the  probability  requirement 


P^[CS|R(n)]  >  P*  for  all  X  e  ^(p)  . 


(2.2) 


The  event  [CS|R(n)]  occurs  iff  the  selected  subset  contains  at  least  one  of 
"(k-t+l) . "(kK 

Similarly  given  P*,  n  and  t  >  k-m  a  subset  selection  probability 

requirement  (over  all  fl)  can  be  achieved  by  use  of  a  class  of  rules  R(h) 

defined  for  h  e  H  a  {h.:  U  E*  Rid  e  C0,»)}  where 

a  \  n 


(1)  h^(x)  >  x  for  all  x,  d 

(2)  hQ(x)  =  x 

(3)  For  every  x,  hd(x)  is  continuous  in  d 


(4)  For  every  x,  hd(x)  ■+  x  as  d  -*■ 


For  each  h  =  h^  e  H,  R(h)  is  defined  by  (2.1)  using  h  rather  than  hn< 


Goal  S.  Given  P*,  n,  t  find  h  c  H  such  that 


P  [CS|R(h)]  >  P*  for  all  ^  c  0 


(2.3) 


Remark  2.1.  Intuitively  the  subset  selection  approach  is  applicable  since 
selection  of  m  populations  guarantees  that  a  correct  selection  has  occurre. 
and  hence  (2.3)  holds, 


3.  Probability  of  a  Correct  Selection 


Since  the  form  of  the  rules  R(n)  and  R(h)  are  the  same  and  a  correct 
selection  occurs  for  either  iff  at  least  one  of  the  t  best  populations  is 
selected  the  following  result  is  applicable  to  both.  Let  V  =  {k-t+1 , . . .  ,k) 
and  V  =  ,k-t) . 


Theorem  3.1 


p^[cs|r]  is  / 


in 

for  any 

i  e  V 

when 

all 

other 

ACj]'s 

are  fixed 

in 

*Ci] 

for  any 

i  c  V 

when 

all 

other 

XCj]’s 

are  fixed. 

Proof.  As  usual  let  P,  [CSlR]  =  E,[n(T)]  where 

-  k  l  * 


-1, 


1,  T(j)  I  "’ax^T[k-7n+l]’  h  (T[k]^  for  Some  i  e  V 


= 


0 ,  otherwise  . 


Then  from  a  lemma  of  Mahamunulu  (1967)  and  Alam  and  Rizvi  (1966)  it  suffices 
to  show  n(£)  +  in  T(i)  for  any  i  e  V  when  all  other  T^j's  are  fixed 


and  o(T)  *  in  T^j  for  any  i  e  V  when  all  other  T^j’s  are  fixed.  To 


prove  the  former  pick  any  £  with  n(^)  »  1,  fix  i  e  V  and  choose  T' 
having  and  for  all  j  /  i,  n(T)  =  1  implies 
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¥ 


there  exists  j .  e  D  with  h(T, .  . )  >  Tr,  ,  and  T, .  .  >  Tr, 

0  (3Q)  -  [k]  ( 3 Q )  -  [k-m+1] 

Subcase  1.  i  =  j^.  It  can  be  seen  that  t!  .  x  >  T, .  s  implies  t! .  v  > 

-  0  (:Q)  (]Q)  K  n0)  - 


[k-m+1] 


and  h(T!.  .)  >  Tp.  ,  hence  n(T*)  =  1. 

-  LkJ  -V 


Subcase  2.  i  f  j^.  Again  straightforward  arguments  show  that  either 
(1)  T(jQ)  -  max{T[k-m+l]*  h  1(T[k])} 


or 


(2>  Tj£)  ^  max{T*k  ^i:j ,  h  1(T^1)} 


ltk]; 

hence  n(T')  =  1.  The  second  part  of  the  proof  follows  along  the  same  lines. 

Hence  the  infimum  of  the  probability  of  a  correct  selection  occurs  when 
for  all  left,  *[i]  is  as  large  as  possible  and  when  for  all  i  £  P, 

Aj-.-j  is  as  small  as  possible. 

Corollary  3.1.  For  t  <  k-m,  inf  P.[CS|R(n)]  =  inf  P  [CS|R(n)]  and 

*  '  „t,  .  ^  t.  X  ^ 


nfc(P) 

for  t  >  k-m,  inf  P.[CS|R(h)]  =  inf  P.[CS|R(h)]. 

n  *  4. 


Vp) 


These  last  two  probabilities  can  be  expressed  as  follows.  Let 


w  v  \  T(a+b)  ^  a-1,,  ^b-lj 

1<y!  ">b)  =  f(.) r(b)  l  "  (1-u)  d" 


denote  the  incomplete  beta  function  with  parameters  a  and  b.  Also  for 
all  A*  e  fl*(p)  3  1  €  A'  s  {A  e  A|p(A)  e  A)  with  A*  =  (p(A), . . .  ,p(A)  ,A , . . .  ,A ) 
and  then 


r 
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PA*CCS|R(n)D 

% 


P[Z  >  m*{TCImm1].  Where 


Z  =  max<T(k_t+1).--->T(k)> 


i=k-m+l  PCZ  =  Tci]*  hn<TCi:,)  1  T[k]:i 


since 


t  <  k-m  =>  [Z  >  Tr,  ..-,3  =  U  [Z  =  Tr.,] 
-  Ck-mU]  i=k-m+l  [i] 


k 

I 


l  P[Z  =  T,  . .  =  Tr  .  ,  h  (T,..)  >  Tr.  ,]  . 
i=k-m+l  j=k-t*l  <:>  [l]  n  <’>  -  M 


But 


[Z  =  T(j)  =  T[i;j] 


=  CT(j)  >  T(£)  for  (i-t)  t's  with  l  e  V-,  T(j)  <  T(4) 


for  (k-i)  !*s  with  ieP;  T(j)>T(£)V  *■  e  P\{j}] 

(*-*) 

£  CT(i)  ’  T(«)  *  «eS>PMj)i  T(j)  <  T(1)  V  t  t  3*v], 


where  {S*}  is  the  collection  of  all  subsets  of  size  (i-t)  from  V  and 
c*  -  me* 


S1  =  V\S\  So 

V  '  V 


P  *CCS|R(n)]  = 


k 

l 


k 

l 


(k-t) 

'i-t' 


l  P 


i=k-m+l  j=k-t+l  v=l 


T(l)<T«)VtESv,n>\(j)i 


T(J)<I(l)<hn(T(1))StESv 


{Gn(y|p(A)))i_tdGn(y|x) 


(3.1) 


0(X,t,n)  where 


8 


f  v  .  ]  G  (y|p(A))  . 

8(A,t,n)  =  jrx  ^Gn^hn(y^lP(X))^k*,tI^^^yT|prfTys  k-ro~t+1»”yd{Gn(y^A^  ’ 


Hence  for  t  <  k-m 


inf  P.CCS|R(n)]  =  inf  8(A,t,n)  .  (3.2) 

q^p)  "  XeA* 


Similar  aiguments  show  for  t  -  k-m 


inf  P  [CS|R(h)]  =  infy(A,t,h)  (3.3) 

fl  i  AeA 

where 

y(A,t,h)  =  f  .  (G  (h(y) | A )}^_td{G  (y | A))t  . 

J  n  n 

n 

Remark  3.1.  If  the  selection  problem  is  a  location  or  scale  parameter  prob¬ 
lem  then  a  change  of  variables  shows  0( A  ,t ,n)  and  y(A,t,h)  are  independent 
of  A  and  hence  (3.2)  and  (3.3)  are  completely  evaluated.  The  normal  means 
problem  will  provide  an  example. 

When  8(A,t,n)  depends  on  A  and  Gn(y|A)  and  p(A)  satisfy  regu¬ 
larity  conditions  (3.3)  of  Santner  (1974)  then  the  following  conditions  imply 
B(A,t,n)  is  monotone  and  gives  the  evaluation  of  (3.2).  This  result  gener¬ 
alizes  Theorem  3.2  of  the  earlier  paper. 

Theorem  3.2.  If  all  derivatives  in  (3.5)  and  (3.6)  exist  and  for  all  AeA' 


EX  =  E 
n  n 


(3.4) 


3G  (h  (y)|p(A))  3G  (y|A) 

g«(y  I A  )  - - hJ(y)gn(hn(y)|p(A))  — - - >  0  a« 


g^ylA)  — ^ 


3A 

3G  (y|p(A)) 


TT 


3G  (y|A) 

”  gjjtylp^))  - gjj -  1°  ae 


(3.5) 


(3.6) 
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then  B(A,t,n)  is  nondecreasing  in  X. 

The  proof  of  the  result  follows  from  an  application  of  Theorem  2.1  of 
Gupta  and  Panchapakesan  (1972)  where,  in  their  notation,  we  have  taken 


Fx(y> 
4»(y ,  A ) 


(Gn(y|X)}t 


{Gn(hn(y)jp(X))}k'tI 


Gn(y|p(A)) 


k-m-t+1  ,m 


Hence  if  there  is  a  smallest  XQ  e  A'  and  (3. 4)- (3. 6)  hold  then 
inf  P[CS|R(n)]  =  B(AQ,t,n). 

nt(p) 


Remark  3.2.  If  (3.5)  and  (3.6)  are  both  nonpositive  then  g(A,t,n)  is  non¬ 
increasing  and  (3.2)  can  again  be  evaluated. 


Remark  3.3.  If  (3.4)  and  (3.6)  hold  for  all  AeA  with  h^(y)  replaced  by 
h(y)  then  y(A,t,h)  is  nondecreasing  and  hence  (3.3)  can  be  computed. 
Panchapakesan  (1969)  gives  this  result  for  the  case  m  =  k. 


4.  Properties  of  the  Rules 

When  t  k-m  we  establish  the  consistency  of  any  sequence  of  rules 
(R(n)}  in  the  sense  that  given  any  (P*,p(A))  requirement  there  exists  a 
sample  size  n  which  attains  (2.2). 

Theorem  4.1.  When  t  <_  k-m  if  there  exists  A^  e  A'  and  N  ^  1  such  that 


=  $(AQ,t,n)  for  all  n  ^  N 


inf  B(A  ,t  ,n) 
A' 


(4.1) 
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then  inf  P.[CS|R(n)]  -*•  1  as  n  -*•  “. 

^(p)  * 


Proof.  From  Section  3  and  (4.1)  it  suffices  to  show  B(X0»t,n)  -*•  1  as 
n  -*•  “.  Sirce  p(Xg)  <  3  a  3  p(XQ)  <  a  <  XQ.  Let 


fnli(y)  =  {G"(yK>> 


t-1 


n  1  0 


fn2i(y)  =  {Gn(yIP(X0)>} 


i-t 


fn31«  -  (Wy)lp(*0))  -  Gn(ylp(x0))) 


k-i 


then  by  (3.1)  if 
as  n  ®  the  result  holds. 


if  [  II  f  . ,(y)dG  (y|Xrt)  0  for  i  <  k  and  -*>  1/t  for  i  =  k 
J  •  _!  nji  J  n  1  0 


Case  A.  k-m+1  <  i  <  k-1.  Given  0<e<l  3  M  >  N  3 


Gn(a^0)  <  e/2 


Gn(a|p(X0))  >  1  -  c/2 


for  n  >  K  since  T,  as  n  *.  So  for  all  y  >  a 

—  in  0  — 


3 

n 


<  {Gn(hn(y)|p(A0))  -  Gn(y|p(A0))> 

<  1  -  Gn(a|p(X0)) 


<  e/2  for  all  n  >  M  . 


Hence  for  all  n  >  M, 


< 


V* 


Case  B.  i  =  k.  Since 
t  \  ^n(y|X0}t"1dGr(g|x 

1  -  e '  and  H  >_  N  3 

So  for  all  n  >  M, 


|  ldGn(y|y  ♦  |  E/2dGn(y|X0)  <  c  . 

— ao  a 

f„3k<y>  5  1  ~  1  |  fnjk<«)dGn(yl10>  i 
q)  =  1.  Given  0  <  e'  <  1  choose  e  3  (l-e)k  t(l-et)  = 

for  all  n  >  M,  G  (a|X  )  <  e  and  G  (o|p(X  ))  >  1-  e. 

n  0  n  0 


i  > 


> 


> 


2  00  2 

"  fnjk<y>dGn<yl V  -  1  f  "  fi,jk<y’dGn<ylV 

3=1  Ja  j=l 


t  |  {Gn(y|X0)}t“1(l-e)k‘tdGn(y|X0) 


a 

(l-£)k-t[l  -  Gn(a|X0)t] 

(1-e)  (l-ek)  =  1-e*  and  completes  the  proof. 


Remark  4.1.  Similarly  if  there  exists  X  e  A  such  that  inf  Y(X,t,h)  = 

0  A 

Y(XA,t,h)  for  411  h  e  H  then  lim  P.  [CS I R(h)]  =  1.  Hence  (2.3)  can  always 
0  d-*«  i 

be  achieved  by  a  rule  in  the  class  {R(h)|h  c  H}. 

As  mentioned  in  Section  1  since  monotonicity  properties  of  R(n)  and 
other  indicators  of  its  performance  such  as  the  number  of  selected  populations, 
S(n);  its  expected  value,  E[S(n)];  sup  E[S(n)]  and  convergence  properties 

n 

of  (S(n)}  do  not  depend  on  the  goal,  these  results  can  be  found  in  Santner 
(1974). 


I 


5 .  Applications 


This  section  contains  several  examples  of  applications  of  the  results 
of  this  paper  to  selection  from  specific  populations. 

I.  Suppose  'V'  N(y^,o  ),  i  =  l,...,k  where  the  common  c  is  known  and 

the  experimenter  is  interested  in  selecting  at  least  one  of  the  t  popula- 

tions  with  largest  p.’s.  Hence  X.  =  p.  and  taking  T.  X.  =  —  )._.  X.. 

x  ii  min  *-j-±  ij 

k 

gives  =  *(«  (y-Wj)/o)  where  $  is  the  cdf  of  a  N(0,1)  random 

variable.  Since  is  a  location  parameter  family  we  take  p(p)  =  p-6 

and  h  (x)  =  x  +  da/Sri 
n 


Q^p)  =  -  “[k.,]  i 


(5.1) 


|^R(n):  Select  <-o  Xi  >_  max{Xtk-m+l] ’  X[k]  “  do^’^  * 


Applying  (3.2)  and  a  change  of  variables  gives 


00 

inf  P,[CS|R(n)]  “  j  (4’(y +d 

\  -00 


&  «  k-t 

*—■ » 


n  (p) 


.(  Sri 

♦(y  ♦  — r-) 

I  - ;  k-a>-t+l,m  d*(y) 

(*(y+d+  — — ) 


(5.2) 


From  earlier  work 


00 

sup  Ex[S(n)]  «  k  j  {*(y  +  d)}k  l  1  T(7+3T;  (5-3) 


I 


13 


Setting  the  right-hand  side  of  (5.3)  equal  to  1  +  e  and  solving  for 
d  and  then  equating  the  right-hand  side  of  (5.2)  to  P*  gives  the  proper 
sample  size  required  to  guarantee  both  (2.2)  and  the  maximum  expected  number 
of  selected  populations  is  less  than  1  +  e.  Table  I  gives  the  values  of 
required  to  implement  the  rule  for  selected  k,  m  and  t. 

II.  Estimators,  ,  having  noncentral  chi  square  and  .loncentral  F  distri¬ 
butions  arise  in  problems  of  selection  from  multivariate  normal  populations 
with  known  and  unknown  covariance  matrices.  See  Gupta  (1966),  Alam  and 
Rizvi  (1966)  and  Gupta  and  Studden  (1970).  For  an  example  of  restricted  sub¬ 
set  selection  from  noncentral  chi  square  populations  with  t  =  1  see  Santner 
(1973). 

Suppose  that  T.^  has  a  noncentral  F  cdf  with  parameters  p  and  q 
and  noncentrality  parameter  X^  to  be  abbreviated  F^  o(y|x^).  Bot^  P 
and  q  may  depend  on  n  but  the  dependence  is  suppressed  for  ease  of  nota¬ 
tion.  Here  A  =  C0,“),  G  (y|X)  =  F  (y|X)  and  g  (y|x)  =  dF  (y|x)/dy  = 

n  p  ,q  n  p  ,q 

f  (y|X).  Take 

p.q 

°i^lW(V1)'  («!  >  0.  «2  >  1) 

X  >  6162/(62’1) 


p(X)  = 


\-6v 


X/62, 


and  h  (::)  *  d^nx, 
n 


d  >  1. 


Hence 


Table  I.  Lists  values  of 


required  to  attain  P*  levels 


90  and  .975  by  rules  (5.1)  with  d  =  .4(. 4)1.6  for  various  k,  m  and  t 


m  t 
2  2 

2  2 

3  2 

4  2 

2  2 
3 

3  2 
3 

4  2 


1.080 

.656 

1.015 

.596 

1.005 

1.230 
.  335 
.546 
1.162 
.771 
.484 
1.150 
.759 

1.468 

1.103 

.853 

1.395 

1.031 

.783 

1.377 

1.015 

,767 

1.619 

1.264 

1.029 

1.543 

1.189 

.953 

1.521 

1.169 

.934 


1.121 

1.498 

1.457 

1.453 

1.701 

1.250 

1.650 

1.203 

1.644 

1.837 

1.410 

1.105 

1.781 

1.359 

1.058 

1.773 

1.351 

2.054 

1.652 

1.380 

1.994 

1.593 

1.324 

1.982 

1.582 

1.314 

2.195 

1.880 

1.539 

2.128 

1.736 

1.476 

2.115 

1.722 

1.466 


1.914 

2.234 

2.203 

2.203 

2.406 

1.921 

2.375 

1.898 

2.375 

2.531 

2.062 

1.750 

2.492 

2.031 

1.718 

2.484 

2.031 

2.726 

2.281 

1.984 

2.679 

2.234 

1.945 

2.671 

2.234 

1.945 

2.851 

2.406 

2.125 

2.796 

2.367 

2.085 

2.789 

2.359 

2.078 


1.061 

.660 

.361 

.865 

.462 

.164 

.785 

.387 

1.323 

.954 

.701 

1.130 

.759 

.505 

1.041 

.669 

.481 

1.486 

1.128 

.891 

1.298 

.937 

.701 

1.204 

.842 

.607 


1.261 

1.089 

1.054 

1.492 

1.025 

1.308 

.848 

1.257 

1.644 

1.207 

.892 

1.460 

1.023 

.712 

1.396 

.966 

1.886 

1.478 

1.199 

1.701 

1.293 

1.011 

1.625 

1.218 

.943 

2.039 

1.642 

1.375 

1.853 

1.457 

1.187 

1.771 

1.375 

1.109 


1.585 

1.960 

1.828 

1.804 

2.164 

1.664 

2.007 

1.523 
1.976 

2.304 

1.828 

1.492 

2.140 

1.671 

1.343 

2.093 

1.632 

2.523 
2.070 
1.765 
2.351 
1.898 
1.601 
2.296 
1.843 
1.554 

2.664 

2.214 

1.925 

2.492 

2.039 

1.757 

2.421 

1.984 

1.695 


Table  I  ( cont . ) 


k 

m 

t 

d 

1.2 

1.6 

P* 

Pft 

.75 

.90 

.975 

.75 

.90 

.975 

4 

2 

2 

ft 

.605 

1.351 

ft 

.476 

1.218 

6 

2 

2 

.521 

1.130 

1.812 

.464 

1.070 

1.750 

3 

2 

.181 

.800 

1.500 

.009 

.613 

1.289 

4 

2 

.029 

.675 

1.414 

* 

.343 

1.062 

8 

2 

2 

.802 

1.390 

2.046 

.763 

1.347 

2.000 

3 

.356 

.910 

1.531 

.310 

.859 

1.476 

3 

2 

.490 

1.076 

1.742 

.377 

.949 

1.593 

3 

.031 

.587 

1.218 

ft 

.435 

1.039 

4 

2 

.316 

.921 

1.617 

.104 

.687 

1.343 

10 

2 

2 

.984 

1.558 

2.203 

.954 

1.525 

2.164 

3 

.576 

1.113 

1.710 

.543 

1.074 

1.664 

4 

.270 

.789 

1.367 

.231 

.746 

1.320 

3 

2 

.668 

1.257 

1.898 

.601 

1.160 

1.781 

3 

.271 

.804 

1.406 

.173 

.693 

1.273 

4 

* 

.472 

1.062 

ft 

.343 

.906 

4 

2 

.513 

1.095 

1.765 

.351 

.912 

1.546 

3 

.095 

.648 

1.281 

ft 

.437 

1.031 

15 

2 

2 

1.263 

1.820 

2.445 

1.242 

1.796 

2.414 

3 

.892 

1.406 

1.796 

2.870 

1.380 

1.945 

4 

.636 

1.125 

1.761 

.612 

1.097 

1.640 

3 

2 

.994 

1.541 

2.156 

.935 

1.474 

2.078 

3 

.617 

1.125 

1.687 

.556 

1.052 

1.601 

4 

.357 

.835 

1.382 

.292 

.757 

1.281 

4 

2 

.824 

1.377 

2.007 

.716 

1.252 

1.859 

3 

.444 

.960 

1.539 

.332 

.826 

1.375 

4 

.183 

.671 

1.242 

.062 

.525 

1.054 

20 

2 

2 

1.435 

1.980 

2.593 

1.418 

1.962 

2.570 

3 

1.076 

1.582 

2.140 

1.059 

1.560 

2.117 

4 

.838 

1.312 

i.843 

.820 

1.291 

1.820 

3 

2 

1.180 

1.716 

2.320 

1.134 

1.664 

2.257 

3 

.818 

1.314 

1.859 

.771 

1.257 

1.789 

4 

.579 

1.039 

1.566 

.530 

.980 

1.492 

4 

2 

1.017 

1.554 

2.171 

.933 

1.457 

2.054 

3 

.650 

1.152 

1.710 

.566 

1.050 

1.578 

4 

.413 

.875 

1.418 

.324 

.765 

1.277 

This  choice 

of 

d  insures 

that  the 

probability  level 

P*  can 

even  be 

attained 

over  all  n  no  matter  what  n  is  used. 

! 


ni  =  {4JA[k-t+l]  '  A[k-t]  -  V 


[^2  "  {^X[k-t+l]  -  V[k-t]} 


n  (p)  =  o  n  «2  with 


A’  =  [  6r-) 


R(n):  Select  it.  T.  >  max{Tr,  , ,,  d  Tr.  and 
l  in  —  [k-m+1]’  [kj 


o'  + 

-A/2  -  v2  r(|+|  +  l)Ar 

H572T  J0  — o:;  -•  y  0 

(1+y )2  r(|  +  r)2rr! 


g^CylA) 


y  <  0  . 


Since  {F  (y|A)|x  >0}  is  a  stochastically  increasing  family.  Theorem  3.1 
p  ,q  — 

applies  to  give  inf  P.[CS|R(n)]  =  inf  6(A,t,n)  where  8(A,t,n)  is  defined  by 

0%)  *  ^1 


(F  (yd 

p,q 


i|X-5,))k't  I  - P *<1<Wn  ~  ’■ 

i*-v 


dr  (y|X),  X  e  Ij 


<F  <yd1/n|s:1»»k't  I  - 
J  P.r  2  F 


.  f  ^  |  g-1^ 

I1A)}^~t  I  - P,(^  *  .  2  .  k-m-t+l  ,tn  dF  (y|X),  A  e  I 

1  P  /..A*-'  "  I  ^  Z 


,(y°  a) 


where 


f h  =  C51’  6i62/(62  ’1)] 


12  =  C5162/(62  "1)’  -) 


Theorem  5.1.  For  any  1  <  d  <  6^  and  n  1,  $(A,t,.')  is  nondecreasing 

in  A  on  1^  and  nonincreasing  in  A  on  1^.  Hence 


inf  8(A,t,n)  = 


6  62-  Is  t,n 


Proof.  A  piecewise  application  of  Theorem  3.2  will  be  made  on  1^  and  I^* 

The  following  argument  shows  the  method  a*  used  on  1^.  Since  dF^  ^(y|A)/dA 

— ^  „(yU)  it  can  be  seen  that  (3.5)  reduces  to 

q-2  p+2 ,q-2 

dl/nfp«,q-2<>'li)fp,q<ydl/n|1  V  i  fp«.q-2(y|A-Sl)-fp.Q<y|l) 

for  all  y  >  0  and  A  e  1^ 


1  fn+9  o  9(ylX)  1  fn+9  n  I' X~6,  > 

-  -Pt2  y<L~2 . . . —  <  * - P*2  >1-2.  y  - —  for  all  y  >  and  A  e  I 

y  #P.q(y|X)  -  T7^  f  (yd^lX-6,) 

P  *4  -L 


1  <  d  <  -> 


¥2  .V 

«2  - 1  d-i 


A  <  fijd/td-l)  V  A  e  I 


\*1/na+y)  d  d1 

*±-77; -  v  y>0  8ince 


V  n  >  1 


d  - 1 


d  -  1 


y(l  *  d1/ny)  >  <1 1  y)d1/n(X  -  61) 


V  v  >  0.  A  e  I.  and  n  >  1  . 
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1  fo+2.q-2(ylX) 

An  application  of  problem  7.4  in  Lehmann  (1957)  shows  that  —  (v|a1 — 

7  P,Q  Y 

is  +  in  for  fixed  d  and.  q  and  hence  (3.5)  holds.  A  similar 

argument  gives  (3.6)  and  hence  completes  the  proof. 

Finally  since  ^2^^2“  D  e  A'  we  have  for  any  1  <  d  <  6^, 

M, 

inf  F[CS|R(n)]  =  8  Or— -  .  ,  t,n)  -*•  1  as  n  -*>  ®.  Expressions  for  E^[S(n)] 

t  ^  a. 

d (?) 

and  sup  E  [S(n)  are  found  in  Santner  (1973). 
fi  * 


III.  Suppose  tt^  has  a  uniform  distribution  on  [0,A^],  i  =  l,...,k  then 

the  problem  of  selecting  one  of  the  t  populations  with  largest  A^'s  is  non- 

A. 

regular  in  the  sense  that  E  1  i  E  for  all  A..  Take  T.  =  max  X. . , 

n  n  1  in  ^ 

p( A )  =  A6  with  0  <  6  <  1  and  h  (x)  =  d1/n  where  d  >  1  and  then 

n 

A^  as  n  -*•  ”  and  the  problem  can  be  formally  stated  in  the  language 
of  Section  2  as 


A  =  (0,-) 


y  <  0 


Gn(y|X)  =/  (y/A )n ,  0  <  y  «  A£ 


y  txi 


fit(p)  =  ^lx[k.t+1]  i6  lx[k-t]} 


R(n) :  Select  <-*  Tj  >_  roax{d"A  n  T^]»  T[k_nH.i]^  * 
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Theorem  5.2 


inf  P.[CS|R(n)] 
^(p)  * 


1  +  6n{— <t -  k)I(l/d;  k-m-t+1  ,m)  - 
dk 

n  (nr?rr-)I(1  -  1/A>  ^.k-m+i)} 
j=i  K-t+:i 


(5.4) 


Proof.  Since  {^n(y|X)|X  e  (0,®)}  is  a  stochastically  increasing  family  for 
each  n.  Theorem  3.1  applies  to  pive 


inf 

nt(p) 


P  [CS|R(n)j  = 
% 


inf  8(X  ,t  ,n) 
X>0 


where 


6(X,t,n)  =  TX(X)  +  T2(X)  +  T3(X)  and 


T^A) 


j=k-t-m+l 


rX6/d1/n  nj  r  n 

i  ’  l  <&>  («*> 


(x,n 

J 


k-t-j 


A  change  of  variables  and  a  lengthy  but  routine  calculation  shows  all  T^(X)'s 
to  be  independent  of  )  and  gives  (5.4)  completing  the  proof. 

Either  by  direct  examination  of  (5.4)  or  an  appeal  to  Theorem  4.1  with 
X.  =  1  shows  inf  P[CS|R(n)]  1  as  n  ».  Hence  any  (P*,  p(A))  require- 

nt(P) 

ments  (2.2)  can  be  achieved.  All  usual  monotonicity  properties  hold  for  R(n) 
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and  details  of  their  derivation  can  be  found  in  Santner  (1973).  In  particu 
lar. 


sup  E.[S(n)]  =  k  I(l/d;  k-m,m)  +  ^  1(1  -  1/d;  m+l,k-m)}  . 

„  a  k  k 
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